Interactive Policy Learning through Confidence-Based Autonomy
نویسندگان
چکیده
We present Confidence-Based Autonomy (CBA), an interactive algorithm for policy learning from demonstration. The CBA algorithm consists of two components which take advantage of the complementary abilities of humans and computer agents. The first component, Confident Execution, enables the agent to identify states in which demonstration is required, to request a demonstration from the human teacher and to learn a policy based on the acquired data. The algorithm selects demonstrations based on a measure of action selection confidence, and our results show that using Confident Execution the agent requires fewer demonstrations to learn the policy than when demonstrations are selected by a human teacher. The second algorithmic component, Corrective Demonstration, enables the teacher to correct any mistakes made by the agent through additional demonstrations in order to improve the policy and future task performance. CBA and its individual components are compared and evaluated in a complex simulated driving domain. The complete CBA algorithm results in the best overall learning performance, successfully reproducing the behavior of the teacher while balancing the tradeoff between number of demonstrations and number of incorrect actions during learning.
منابع مشابه
Confidence-Based Robot Policy Learning from Demonstration
The problem of learning a policy, a task representation mapping from world states to actions, lies at the heart of many robotic applications. One approach to acquiring a task policy is learning from demonstration, an interactive technique in which a robot learns a policy based on example state to action mappings provided by a human teacher. This thesis introduces Confidence-Based Autonomy, a mi...
متن کاملScalability of Confidence-Based Autonomy Multi-Robot Demonstration Learning
In this paper, we present the first application of demonstration learning to more than two robots and perform an analysis of the scalability of the Confidence-Based Autonomy (CBA) multi-robot demonstration learning algorithm. Through experimental evaluation using up to seven Sony AIBO robots, we examine how the number of robots being taught by a human teacher at the same time affects the number...
متن کاملFlexible Demonstration Learning System for Variable Number of Robots
In this paper, we present flexMLfD, a robot independent and task independent demonstration learning system that supports a variable number of robot learners. Our approach is based on the Confidence-Based Autonomy (CBA) demonstration learning algorithm, which provides the means for a single robot to learn a task policy through interaction with a human teacher. The generalized representation and ...
متن کاملThe Impact of Fostering Learner Autonomy through Implementing Cooperative Learning Strategies on Inferential Reading Comprehension Ability of Iranian EFL Learners
Abstract The great shift of paradigm from teacher-centeredness to learner-centeredness has one major rationale in line with the definitions of autonomy, i.e., the capacity and willingness to act independently and in cooperation with others, so cooperation is looked upon as the manifestation of autonomy. In the present study, the researchers investigated the impact of training cooperative...
متن کاملInteractive Refinement of Control Policies for Autonomous Robots
The automation of various aspects of life through robotics is a promising and useful mechanism to the general enduser. Robots are required to accept human guidance, and in its absence, have to operate autonomously while ensuring safety and optimality. This paper presents an approach to variable autonomy that extends reinforcement learning with the capability of integrating user guidance at vary...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Artif. Intell. Res.
دوره 34 شماره
صفحات -
تاریخ انتشار 2009